Àá½Ã¸¸ ±â´Ù·Á ÁÖ¼¼¿ä. ·ÎµùÁßÀÔ´Ï´Ù.
KMID : 0917520020090010207
Journal of Speech Sciences
2002 Volume.9 No. 1 p.207 ~ p.215
Perceptual Evaluation of Duration Models in Spoken Korean
Chung Hyun-Song
Abstract
Perceptual evaluation of duration models of spoken Korean was carried out bvased on the Classification and Regression Tree (CART) model for text-to-speech conversion. A reference set of durations was produced by a commercial text-to-speech synthesis system for comparison. The duration model which was built in the previous research (Chung & Huckvale, 2001) was applied to a Korean language speech synthesis diphone database, "Hanmal (HN 1.0)". Tfhe synthetic speech produced by the CART duration model was preferred in the subjective preference test by a small margin and the synthetic speech from the commercial system was superior in the clarity test. In the course of preparing the experiment, a labeled database of spoken Korean with 670 sentences was constructed. As a result of the experiment, a trained duration model for speech synthesis was obtained. Tfhe "Hanmal" diphone database for Korean speech synthesis was also developed as a by-product of the perceptual evaluation.
KEYWORD
FullTexts / Linksout information
Listed journal information